Skip to content

Conversation

@AnishPahilajani
Copy link

Making triton soft dependency so we can run GPTQ models

Added a try and except block on 2 imports

Related issue number

How to verify the PR

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

@AnishPahilajani AnishPahilajani changed the title made triton a soft dependency Making triton a soft dependency Mar 11, 2025
@BrandonGroth
Copy link
Collaborator

BrandonGroth commented Mar 11, 2025

Triton is in the pyproject.toml as a hard dependency for hardware-independent matrix multiplication kernels. Do you have any example of how GPTQ was blocked?

@AnishPahilajani
Copy link
Author

AnishPahilajani commented Mar 12, 2025

Yes,
When running inference on Power/Spyre stack I get the following error

Failed to import addon packages: No module named 'triton'
Traceback (most recent call last):
  File "/tmp/aiu-fms-testing-utils/scripts/inference.py", line 211, in <module>
    raise Exception("GPTQ not enabled")
Exception: GPTQ not enabled

I dont think it matters what parameters I used, but here are the parameters:

  • 1 AIU card
  • input = 64 tokens
  • output = 64 tokens
  • batch_size = 1
  • running the granite gptq instruct model
  • gptq is enabled

@BrandonGroth
Copy link
Collaborator

BrandonGroth commented Mar 12, 2025

What is the status of installing Triton on Power? If it doesn't work, we can make Triton an optional package. We already have a framework for doing so.

@AnishPahilajani
Copy link
Author

When I try to install using pip.
pip install triton

I get the following error

ERROR: Could not find a version that satisfies the requirement triton (from versions: none)
ERROR: No matching distribution found for triton

@chichun-charlie-liu
Copy link
Collaborator

replaced by #78

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants